Premium Forecasting of an Insurance Company: Automobile Insurance 
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We present an analytical study of an insurance company. We model the company's performance 
on a statistical basis and evaluate the predicted annual income of the company in terms of insurance 
parameters namely the premium, total number of the insured, average loss claims etc. We restrict 
ourselves to a single insurance class the so-called automobile insurance . We show the existence a 
crossover premium p c below which the company is loss- making. Above p c , we also give detailed 
statistical analysis of the company's financial status and obtain the predicted profit along with the 
corresponding risk as well as ruin probability in terms of premium. Furthermore we obtain the 
optimal premium p op t which maximizes the company's profit. 

PACS numbers: 05.45.Tp, 05.40.-a, 89.90+n 
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I. INTRODUCTION 



Rapidly expanding field of interdisciplinary physics 
has witnessed the desire of statistical physicists to ap- 
ply their knowledge to other areas of analytical sciences 
which till recently used to fall outside the traditional do- 
main of physics. A broad class of systems exhibiting 
complex dynamics, associated with the presence of many 
interacting constituents, has shown to be successfully in- 
vestigated with the concepts and techniqueSppf statistical 
physics. Protein folding, financial marketsErB, flow of ve- 
hicular traffic and granular material!], surface growth of 
interfaces are a few examples of numerous applications 
of physical methods to the paradigm of inter-disciplinary 
researches. In particular, economic systems has remark- 
ably spurred the interest of statistical physics commu- 
nity. Specifically, stock markets has been the main at- 
tractor of these interests and nowadays there is relatively 
a rich amount of results, both analytical and numerical on 
modelling of financial markets. Large number of traders 
with a diversity of trading strategies, conflicting inter- 
ests and heterogeneous anticipations are generic aspects 
of financial market leading to unpredictable price fluctua- 
tions and other out of equilibrium phenomena which have 
challenged physicists as well as economists for explana- 
tion. Nevertheless, stock markets constitute only a part 
of economy and there are other major financial systems 
which can be studied in the context of statistical physics. 
In this paper, we present both analytical and numerical 
investigations of one such economic systems: an insur- 
ance market . Human beings arc continuously confronted 
to different types of dangers which threaten his/her life 
in many aspects. Illness, earthquake, car accident, bur- 
glary, death and many others are simple examples of in- 
evitable dangers to which the human life is exposed. The 
occurrence of these adverse events is random in nature 
and frequently it is impossible to predict even the prob- 
ability of their occurrence. Insurance is way to provide 



guarantee of compensation for the losses of these dam- 
ages. The purpose of insurance is to indemnify policy 
holders against the occurrence of adverse events. There 
is tremendously variety in the events that are covered by 
insurance. Common to all insurance systems, there ex- 
ists a risk i.e., a condition in which there is a possibility 
of an adverse deviation from an expected outcome. Some 
or all of the risk is transferred from the insured to the 
insurer with an expectation that through the pooling of 
risks, the insurer will improve the estimation of expected 
total losses. In this paper we aim to model the perfor- 
mance of a virtual insurance company in order to find a 
better insight into the stochastic nature of the problem. 



II. FORMULATION OF THE MODEL 



We now present our model which simulates the per- 
formance of an insurance company. In this paper we 
restrict ourselves to a single insurance class the so-called 
automobile insurance. In principle, a number of clients 
(policy holders) insure their cars against car accidents. 
The company issues the corresponding insurance policies 
and receives an annual premium p from the policy hold- 
ers. Generally speaking, the amount of premium depends 
on many characteristics of the generic car such as type, 
model, production year etc. However for the sake of sim- 
plicity, in this paper we restrict ourselves to a category of 
cars for which the premium is almost a constant amount. 
The company's performance is modelled by the series of 
loss events which are randomly occurred in the course of 
time. Each loss event i.e.; damages to cars due to acci- 
dents, is incorporated with a loss amount which should 
be regarded as the amount of loss confirmed by company 
for each accident. The loss size is itself a random variable 
ranging from tiny to large amounts. Upon occurring a 
loss event, the company pays the claim amount to the 
policy holder which reduces the company's capital. On 
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the other hand, the premium income which is injected to 
the company raises its capital. It is the competition of 
these two factors which determines the financial status 
of the company. A low frequency of loss events together 
with a high premium income makes the company prof- 
itable while a low premium income and high frequency 
of loss events gives rise to a ruin state. Evidently the 
amount of premium plays a dominant role in insurance 
industry . The task of premium calculations and the 
related strategies have been challenging and controver- 
sial subjects for insurance companies andrrati drastically 
affect the financial status of the companyQ'BO. In a com- 
petitive economy, there are plenty of insurance companies 
each of which offer their own premium. If a company in- 
creases the premium, the number of clients willing to in- 
sure their cars decreases (due to competition of the other 
companies offering lower premiums). In other words, the 
company's attempt to raise its income through increas- 
ing the premium may fail due to decrease of the number 
of insured which obviously reduces the premium income. 
Therefore, increasing the premium is a highly risky ac- 
tion and it is of prime importance for the company's man- 
agers to have an estimation of the risk amount. Thus it is 
indispensable for a company to have a quantified knowl- 
edge on how the premium variations affect the long-term 
profit. To be more specific, in each day, a random num- 
ber of clients insure their car in the company. We denote 
this number by X t for t-th day. We assume that the num- 
ber of daily policy issued obeys a simple statistics with 
premium-dependent characteristics such as mean, stan- 
dard deviation etc. In this paper we assume that T t is 
uniformly drawn from the interval [a, b] with b = t5 and 
a = \ such that X t = T for all t. As a simple choice, we 

- — P-PQ 

take an exponential form X(p) — Xo e ™ T for the aver- 
age number of daily policy holders, po is an arbitrary ref- 
erence point, 2o is average number of insured for p = pa 
and the parameter r measures the predicted fall of the 
number of policy holders upon increasing the premium. 
For instance r = 0.45 corresponds to a 20 percent fall 
upon a 10 percent premium increase over p n . We now 
discuss on the distribution of the daily number of loss 
events in t— th day which is denoted by At- Evidently 
it implicitly depends on the previous number of issued 
policies X)i'<t-^t'- However, for the sake of simplicity, 
we make a simple assumption such that At = 9X where 
constant 9 measures the ratio of loss events to number of 
issued policies at an average level. We recall that 9 can be 
interpreted as the probability of occurrence of loss event 
(per car). We note that 9 can be estimated via empirical 
data of realistic insurance companies. Analogous to T tl it 
is assumed that At is uniformly distributed in the inter- 
val [a9, b9] such that At = 91 for all t. The loss amount 
itself should be considered as a stochastic variable. In 
this paper we adopt Erlang's modetrtj for statistical de- 
scription of loss amounts. According to this model, the 
distribution of loss amount obeys an exponential function 
of the form where £ denotes the average loss. 



III. STATISTICAL DESCRIPTION 

In this section we present the analytical results which 
give the annual capital of the company in terms of insur- 
ance characteristics i.e., premium, statistics of the num- 
ber of insured, statistics of loss events etc. For this pur- 
pose, in each working day we randomly draw two integer 
number X and A as the numbers of issued policies and 
loss events. Correspondingly, for each loss event, we at- 
tribute a random loss amount (drawn from Erlang distri- 
bution function) . The sum of loss amounts gives us the 
daily loss which is denoted by C. Denoting the number 
of annual working days by Af and the capital at the end 
of t-th day by Ct , we simply have 

C t = C t -i + pit - £t (1) 

where t denotes the day number. Focusing our atten- 
tion on the long-term profit, we now evaluate the average 
long-term capital of the company for a period of Af work- 
ing days. From the above recursive relation, one simply 
finds by iteration: 

AT A/" 

C^ = C +pY, I t-J2 C t ( 2 ) 

t=i t=i 

which determines the long-term capital in terms of aggre- 
gate loss and premium income. In order to predict the 
capital, we now average the above relation over many 
virtual realisations of future each of which corresponds 
to Af working days. Let us first define such predictive 
averaging process (hereafter referred to averaging ) for a 
generic variable tpt which for instance could be the num- 
ber of accidents or the number of issued policies in t-th 
day. We define the average of ipt over M virtual runs as 
follows: 

<^t>:=^[4 1} + --- + 4 M) } (3) 

Where each ipj ' is a random number drawn from a spec- 
ified distribution function. It can easily be verified that 
for the quantities X t and At the averaging does not de- 
pend on day number t. To see this explicitly let us eval- 
uate the average of the number of issued policies in the 
t-th day. According to the above definition we have: 

<^>=^ t (1) + ---+4 M) ] (4) 

The term in the bracket is the sum of M realisations 
of the stochastic variable T t and provided M is large 
enough, the sum simply converges to the average of the 
distribution function which is 1 for all days. Concerning 
this fact, we now average over the long-term capital and 
obtain the following relation where we have dropped the 
initial capital Co for simplicity. 
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< C N >= Af(p <!>-<£>) 



(5) 



We define the critical value of premium p c at which the 
average long-term profit equals zero < Cj\f >= 0. It is 
worth obtaining < C >t in terms of statistics of loss 
events. The loss amount in the t-th day is an extended 
random variable. By "extended" we mean that there 
are two sources of randomness. Firstly the number of 
loss events and secondly the loss amount for each loss 
event. In the appendix, it is shown, in details, that the 
average loss amount for each day is simply the average 
loss amount, in each single loss event, multiplied by the 
average number of daily loss events i.e., < C t >= £A. 
Concerning the above considerations one obtains the fol- 
lowing expression for the averaged long term capital : 



< C >=AfI e —{p-6£) 



(6) 



In our study, the numerical values of insurance parame- 
ters have been set from the empirical data taken from Ira- 
nian Insurance Industry for the year 2000. Specifically, 
we analysed the data taken from Iran Insurance Com- 
pany the largest company operating in Iran. The empir- 
ical data were: 1,819,935 insurance policy issued, 348961 
loss events leading to 739.7 billion Rls loss. The premium 
amount was 378000 Rls on average. Based on the above 
data, we set our parameter as: 9 = 0.19, £ = 2.12 million 
Rls and Zq = 6070. Also we take N — 300 active days. 
The following graph shows the behaviour of < C > for 
different values of r. 
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Fig. 1: Averaged long term profit of the virtual company 
for three values of r: 0.45, 0.35 and 0.28 which correspond- 
ingly refer to 20, 25 and 30 percent fall in the number of 
issued policies upon increasing the premium by 10 percent. 



which the company profits. If the premium is below p Cl 
the company is loss-making and in the vicinity of the 
crossover point, the company is profitless. The above 
graph gives us another useful knowledge. It determines 
the optimal premium p opt at which the company's cap- 
ital is maximized. However, we should note that the 
aforementioned conclusions are based on mean field ap- 
proach and in principle the company can not rely on 
these average-based arguments. They need a quantified 
insight into the risk of premium variations. A simple 
yet practical quantity which can give us an approximate 
measurement of the risk is the variance of long term 
profit in the predictive averaging processQ. At this stage, 
we try to evaluate the variance of the long-term capital 
C =< (C — C) 2 > where for simplicity we have dropped 
the index Af from the capital index. After some straight- 
forward mathematics we reach to the following formula: 



< (C-C) 2 >= p 2 



E 



< I t I t > > -p 2 M 2 l 2 + 2pN 2 T 2 iO 



-2p 



AT 

E 

i,i'=l 



< ItCtf > 



E 

t,f=i 



< CtCt, > ~M 2 T 2 i 2 d 2 (7) 



It should be noted that even in the case where our 
stochastic variables have simple distribution functions, 
it is not yet an easy task to evaluate the correlation 
functions in the above formula. Recall the definition of 
< Itl t ' > one simply finds: 



<IfXi, 



1 



>:= —12 

M l 1 



(i)j(i) 



j(JW)j(M)j 



(8) 



which obviously differs from < T t ><X t r >. To proceed 
further, we separate the terms t = t' in the above relation 
and use the mean-field approximation in the remaining 
terms t ^ t' to substitute all the terms such as < 2 t Xf > 
by < It >< It' >■ We thus obtain the following relation 
for variance of long term capital. 

< {C-Cf >=N[p 2 {<I 2 > -I 2 )-2p{<l£ > -I 2 £,9) + 

< C 2 > -I 2 £ 2 9 2 } (9) 

The stochastic variable I has a uniform distribution func- 
tion therefore its variance is simply obtained via the rela- 
tion o~ 2 = j^(b — a) 2 where [a, b] denotes the interval over 
which the variable is distributed. We recall that b — a = X 
for X. One can simply show that the variance of daily loss 
is 6 times the variance of I. Recalling the definition of 
the covariance between two arbitrary stochastic variables 
X and Y: 



Cov(X,Y) 



< XY > - < X ><Y > 



(10) 



Accordingly, two distinct regimes are identified: profit 
and loss. There is crossover premium p c = £9 above 



It can be shown that inequality — 1 < Cov(X,Y) < 1 
holds. We thus write the remaining term in the above 
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equation as toajac where — 1 < u> < 1 denoted the co- 
variance between I and C. Putting everything together 
we simply arrive at the following approximate equation 
for the variance of long term capital: 



< (C-Cf >= 



MP 
12 



[ P 2 - 2put0 + rr] 



(11) 



It is seen that risk value has a quadratic dependence on 
premium p and is proportional to the number of working 
days N. 

The following graph depicts the behaviour of standard 
deviation as a function of premium for some values of r: 
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Fig. 3: mean-field standard deviation of annual capital for 
some values of £. N = 300 and u) — 0.3 and r = 0.45 . 

The next concept we deal with is the bankruptcy. A 
useful quantity in risk management is the company's ruin 
probability. This can be expressed as the probability 
that the long term capital falls below zero i.e., P(C < 0). 
To estimate the ruin probability, we use of the so-called 
Tchebycheff inequality!!! The Tchebycheff inequality es- 
tablishes a relation between the variance and the proba- 
bility that a stochastic variable, with finite average and 
variance, can deviate by an arbitrary amount e (e > 0) 
from the mean value: 



< Z > I > el < 



Fig. 2: mean-field standard deviation of annual capital for 
same three values of r as in fig.l . N = 300, £ = 2.12 million 
RIs. and ui — 0.8. 

It is observed that risk value is an increasing function 
of t. This indicates that for larger values of t which cor- 
respond to sharper decrease of insured number, we have 
a less-valued variance in long term capital. This means 
that company makes less profit but with more certainty, 
a situation which is desirable for cautious managers who 
avoid risky decisions. It would also be advantageous to 
look at the dependence of capital variance on loss size av- 
erage £. We recall that there are two principal sources of 
uncertainty in the in- flow/out-flow of the company's in- 
come. The first one is related to the predictive number of 
insured, approximated by the parameter r in our model, 
which is the main in-flow portion of the company's capi- 
tal. The second one is incorporated with out-flow portion 
which is dominated by loss size average £. The following 
graph exhibits the long term capital variance for various 
values of £. 



(12) 



where az denotes the standard deviation of the stochas- 
tic variable Z. Now let us estimate the probability that 
for a prescribed p, the long term profit equals — \C(p) 
where the dimension-less parameter A > denotes the 
depth of ruin. According to Tchebycheff inequality if we 
take Z as C one simply finds: 



P(C = -AC) * l -P{\C 



C\ >C(1 + A)) = 



2C 2 (1 



A) 2 
(13) 



Using equations (9) and (6) for a and C one obtains the 
upper limit of ruin probability in terms of p and A. 



P{C = -AC) * 



2pu9£ + 6> 2 £ 2 



24A/Ip-^) 2 (1 + A) 2 



(14) 



The following graph exhibits the ruin probability depen- 
dence on premium for various ruin depths. 
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IV. SUMMARY AND CONCLUDING REMARKS 
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Fig. 4: estimated ruin probability according to Tcheby- 
cheff inequality. A determines the ruin depth. Here A = 0.1. 
The rest of the parameter are remained unchanged as in fig. 2 

In order to find a deeper insight into the stochastic na- 
ture of the problem, we have carried out simulations to 
obtain the variance of C. We have simulated the com- 
pany's performance for a year and obtain the annual in- 
come C(W). Figure five exhibits the simulated variance 
of annual capital as a function of premium. 
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Fig. 5: simulated standard deviation of annual capital for 
the same values of r. N = 300, £ = 2.12 million RIs. 

The simulation results show a notable deviation from 
those of mean- field approach and gives w w 0.5 as the 
best fitting value for the covariance parameter u>. 



In this paper, we have tried to present , in some de- 
tail, a set of statistical facts emerging from the study of 
a virtual insurance company. The financial operations of 
an insurer can be viewed in terms of of a scries of cash 
inflows and outflows. Premiums and income from invest- 
ments, together with certain other income, are added to 
the reservoir of the assets, while the reservoir is depleted 
by claim payments, expenses of running the business, 
taxes and possibly other items of outflow. From a prac- 
tical point of view, it is of prime importance for every 
insurance company to have a quantified measurement of 
the risk of insuring . In this work, we have modelled the 
performance of a an insurance company both analytically 
and numerically. Our results illustrates the dependence 
of long-term profit in terms of insurance parameters es- 
pecially the premium. We have managed to give a quan- 
tified estimation for the risk of premium increasing. 

Finally we should point out some issues we have not 
discussed here. The occurrence of loss events and its sta- 
tistical features is the prominent factor affecting the com- 
pany's long term profit. Throughout the paper we have 
assumed that the loss size distribution obeys a simple 
statistics i.e. an exponential distribution function e~(. 
This is evidently a localized distribution function with 
finite mean and variance £ and £ 2 respectively. However, 
the main source of ruin in insurance industry is the occur- 
rence of catastrophic events corresponding to overwhelm- 
ing loss sizes. Obviously ordinary distribution functions 
such as Erlang fail to model these extreme events. Inves- 
tigation on this issue needs more exploration. 
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VI. APPENDIX 

In the section we obtain the relation < C t >— £A. In 
order to numerically calculate the average of loss amount 



(i) 



from 



in t-th day, we draw / random numbers n. 
a uniform distribution function where nf 1 denotes the 
number of car accidents in the i-th virtual realisation of 
t-th day. Let denote the loss amount due to the 



i-th accident in the j-th virtual realisation of t-th day. 
Accordingly we have : 



(15) 



where for simplicity we have dropped the day index t. 
The above relation can be rearranged as: 



(0 *m + . . . 4. *!'.) 



7.(1) 



+ ••• + 



n y ' z 



I 



(16) 



Concerning the fact that each n, is considerably large, 
each sum converges to < Z >~ £ therefore we have: 



<£>=£[- 



>W + ---+nW 

I J 



(17) 



Now the term in the bracket is simply the average of 
A therefore we have the relation < £ >= t;A 
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